What is claimed is: 

1. A questionnaire analysis system comprising: 

means for inputting a questionnaire statement including free 
reply description in natural language; 

a network for transmitting a questionnaire reply statement, 

a database for accumulating said questionnaire reply 
statements transmitted through said network;, and 

a text classification engine for reading out said questionnaire 
reply statements from said database and for learning a rule for 
classifying said questionnaire reply statement. 

2. A questionnaire analysis system comprising: 

means for inputting a questionnaire statement including free 
reply description in natural language; 

a database for accumulating said questionnaire reply 
statement; and 

a text classification engine for reading out said questionnaire 
reply statement from said database and for learning a rule for 
classifying said questionnaire reply statement. 

3. A questionnaire analysis system comprising: 

means for inputting a questionnaire statement including free 
reply description in natural language; 

a network for transmitting said questionnaire reply 
statement; 

a database for accumulating said questionnaire reply 
statement transmitted through said network; 

a text classification engine for reading out said questionnaire 
reply statement from said database and for learning a rule for 
classifying said questionnaire reply statement; and 

means for distributing said rule through said network 
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according to a request from a claimant. 

4. The questionnaire analysis system according to claim 1, 
wherein said text classification engine includes: 

morpheme analysis means for analyzing morphemes in all 
sentences in said questionnaire reply statement accumulated in 
said database; 

category-text designating means for designating said 
category and text; 

attribute selecting means for selecting attributes in plural 
questionnaire reply statements being read out from said 
database; 

rule learning means for learning said rule for expressing said 
correspondence of text and category on the basis of said words 
selected by attributes by said attribute selecting means; and 

rule output means for issuing said rule learned by said rule 
learning means. 

5. The questionnaire analysis system according to claim 2, 
wherein said text classification engine includes: 

morpheme analysis means for analyzing morphemes in all 
sentences in said questionnaire reply statement accumulated in 
said database; 

category-text designating means for designating said 
category and text; 

attribute selecting means for selecting attributes in plural 
questionnaire reply statements being read out from said 
database; 

rule learning means for learning said rule for expressing said 
correspondence of text and category on the basis of said words 
selected by attributes by said attribute selecting means; and 
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rule output means for issuing said rule learned by said rule 
learning means. 

6. The questionnaire analysis system according to claim 3, 
wherein said text classification engine includes: 

morpheme analysis means for analyzing morphemes in all 
sentences in said questionnaire reply statement accumulated in 
said database; 

category-text designating means for designating said 
category and text; 

attribute selecting means for selecting attributes in plural 
questionnaire reply statements being read out from said 
database; 

rule learning means for learning said rule for expressing said 
correspondence of text and category on the basis of said words 
selected by attributes by said attribute selecting means; and 

rule output means for issuing said rule learned by said rule 
learning means. 

7. The questionnaire analysis system according to claim 4, 
wherein said attribute selecting means computes a difference 
ASC(co) between a stochastic complexity (SC) of a test set 
without consideration of appearance of word and a stochastic 
complexity (SC) of a text set with consideration thereof, in each 
word co appearing in said text, and then selects said difference 
A SC(co) as an attribute when said difference A SC(co) is lager 
than said threshold x . 

8. The questionnaire analysis system according to claim 5, 
wherein said attribute selecting means computes a difference 
ASC(co) between a stochastic complexity (SC) of a test set 
without consideration of appearance of word and a stochastic 



26 



complexity (SC) of a text set with consideration thereof, in each 
word co appearing in said text, and then selects said difference 
ASC(co) as an attribute when said difference ASC(co) is lager 
than said threshold r . 

9. The questionnaire analysis system according to claim 6, 
wherein said attribute selecting means computes a difference 
ASC(co) between a stochastic complexity (SC) of a test set 
without consideration of appearance of word and a stochastic 
complexity (SC) of a text set with consideration thereof, in each 
word co appearing in said text, and then selects said difference 
ASC(co) as an attribute when said difference ASC(co) is lager 
than said threshold r . 

10. The questionnaire analysis system according to claim 4, 
wherein said rule learning means: 

forms said text set by replacing with an expression of (d 1; c x ), 
(d 2 , c 2 ), (d m , cj [where each d* is a multi- valued discrete 
vector di = (co n , o i2 , co in ) (i = 1, m), co Lj is 1 when word 
obtained by attribute selection co, (j = 1, n) appears in said 
i-th text, or 0 otherwise, q expresses said value (label) of said 
category according to said i-th text and each q is 1 when 
belonging to a specific category, or 0 otherwise, and m is said 
number of texts]; 

selects said rules of if-then-else format and sequentially adds 
said selected rules to said stochastic decision list by employing 
said information quantity standard such as said extended 
stochastic complexity (SC) minimum principle or SC 
minimizing principle; and 

removes said rules one by one from said last one of said 
stochastic decision list, and clips continuously until none should 
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be removed from said viewpoint of said extended SC minimum 
principle. 

11. The questionnaire analysis system according to claim 5, 

wherein said rule learning means: 
5 forms said text set by replacing with an expression of (d 1? Cj), 

(d 2 , c 2 ), (d m , c m ) [where each di is a multi-valued discrete 

vector ^ = (o) il5 w i2 , coj (i = 1, m), co^ is 1 when word 

obtained by attribute selection coj (j = 1, n) appears in said 

i-th text, or 0 otherwise, q expresses said value (label) of said 
10 category according to said i-th text and each q is 1 when 

belonging to a specific category, or 0 otherwise, and m is said 

number of texts]; 

selects said rules of if-then-else format and sequentially adds 

said selected rules to said stochastic decision list by employing 
15 said information quantity standard such as said extended 

stochastic complexity (SC) minimum principle or SC 

minimizing principle; and 

removes said rules one by one from said last one of said 

stochastic decision list, and clips continuously until none should 
20 be removed from said viewpoint of said extended SC minimum 

principle. 

12. The questionnaire analysis system according to claim 6, 
wherein said rule learning means: 

forms said text set by replacing with an expression of (d 1? c x ), 
25 (d 2 , c 2 ), (d m , cj [where each d { is a multi-valued discrete 
vector ^ = (co^, co i2 , co in ) (i = 1, m), is 1 when word 
obtained by attribute selection coj (j = 1, n) appears in said 
i-th text, or 0 otherwise, q expresses said value (label) of said 
category according to said i-th text and each q is 1 when 
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belonging to a specific category, or 0 otherwise, and m is said 
number of texts]; 

selects said rules of if-then-else format and sequentially adds 
said selected rules to said stochastic decision list by employing 
5 said information quantity standard such as said extended 
stochastic complexity (SC) minimum principle or SC 
minimizing principle; and 

removes said rules one by one from said last one of said 
stochastic decision list, and clips continuously until none should 
10 be removed from said viewpoint of said extended SC minimum 
principle. 

13. A computer program product for analyzing 
questionnaire reply which comprises: 

a morpheme analysis procedure for analyzing morphemes in 
15 all sentences in said questionnaire reply statements 
accumulated in a database; 

a category-text designating procedure for designating said 
category and text in said text classification engine; 

an attribute selecting procedure for selecting attributes in 
20 plural questionnaire reply statements being read out from said 
database; 

a rule learning means for learning said rule for expressing 
said correspondence of text and category on said basis of said 
words selected by attributes by said attribute selecting 
25 procedute; and 

a rule output procedure for issuing said rule learned by said 
rule learning procedure. 
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